Latent Block Model for Contingency Table

نویسندگان

  • Gérard Govaert
  • Mohamed Nadif
چکیده

Although many clustering procedures aim to construct an optimal partition of objects or, sometimes, of variables, there are other methods, called block clustering methods, which consider simultaneously the two sets and organize the data into homogeneous blocks. This kind of methods has practical importance in a wide of variety of applications such as text and market basket data analysis. Typically, the data that arises in these applications is arranged as two-way contingency table. Using Poisson distributions, a latent block model for these data is proposed and, setting it under the maximum likelihood approach and the classification maximum likelihood approach, various algorithms are proposed. Their performances are evaluated and compared to a simple use of EM or CEM applied separately on the rows and columns of the contingency table.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New phase II control chart for monitoring ordinal contingency table based processes

In some statistical process monitoring applications, quality of a process or product is described by more than one ordinal factors called ordinal multivariate process. To show the relationship between these factors, an ordinal contingency table is used and modeled with ordinal log-linear model. In this paper, a new control charts based on ordinal-normal statistic is developed to monitor the ord...

متن کامل

Reduced rank models for contingency tables

In recent years much attention has been given to models for two-way contingency tables that can be formulated in terms of reduced rank of a matrix with probabilities. A well-known reduced rank model is the independence model, where the rank is one. For rank higher than one distinct classes of reduced rank models are possible. Each has the independence model as the special case for rank one. A f...

متن کامل

A Parametric Bootstrap Procedure to Perform Statistical Tests in a LCA of Anti-Social Behaviour

In this paper we focus on latent class analysis of a large data set. The data deal with antisocial behaviour which is measured by 24 dichotomous variables. Latent class analysis of such huge data sets is unusual. One of the reasons is that latent class analysis defines models for contingency tables, so, if there is a set of k dichotomous items, then a table of 2 cells is modeled. It will be cle...

متن کامل

Maximum Likelihood Estimation in Latent Class Models For Contingency Table Data

Statistical models with latent structure have a history going back to the 1950s and have seen widespread use in the social sciences and, more recently, in computational biology and in machine learning. Here we study the basic latent class model proposed originally by the sociologist Paul F. Lazarfeld for categorical variables, and we explain its geometric structure. We draw parallels between th...

متن کامل

Non-parametric latent modeling and network clustering

The paper exposes a non-parametric approach to latent and co-latent modeling of bivariate data, based upon alternating minimization of the Kullback-Leibler divergence (EM algorithm) for complete log-linear models. For categorical data, the iterative algorithm generates a soft clustering of both rows and columns of the contingency table. Well-known results are systematically revisited, and some ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010